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ABSTRACT * 

The purposes of the present investigation were to 
examine the dimensionality of the Rotter Internal-External (I-E) 
scale, to test the invariance of its factor structure before and 
. after an intervention designed to alter I-E, and to demonstrate the 
use of confirmatory factor analysis. An a priori model based on the 
content of the Rotter items hypothesized six factors, and a review of 

earlier factor analyses demonstrated that each of these factors 
had been previously identified. Results from time one and time two 
both indicated that this hypothesized model was able to adequately 
explain responses to the Rotter items, %hat the hypothesized factors 
were defined and distinguishable, that the factor structure was 
invariant over time, and that alternative models with fewer 
hypothesized factors or & simpler structure did not fit the data as 
well, in a more general discussion of the Rotter scale it was 
concluded that* when judged by current standards, the Rotter l-E 
scale is a poor measurement instrument; its internal consistency ' 
reliability is minimal, the rationale for its forced-choice format is 
dubious, the assumption of its unidimensionali ty is clearly wrong, 
and the distinguishable facets of the scale cannot be inferred "with 
sufficient reliability to be practically useful. (Author) 
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1 ■ The Multidimensional ity of the Rotter 1~E Stales 

An Application of Confirmatory Factor Analysis 

ABSTRACT 

The | ir poses of the present investigation were to examine the , * 
dimensionality of the Rotter Internal -External <I-E) scale, to test the 
invanance of its factor structure before and after an intervention designed 
to alter I-E, and to demonstrate the use of confirmatory factor analysis^ 
(CFA). An a priori model based on the content of the Rotter items hypothesized ' 
six factors, and a review of 20 earlier factor analyses demonstrated that each 
of these factors had been previously identified. Results from time' i and from 
time 2 both indicated that this hypothesized model was able to adequately 
explain responses to the Rotter items, that the hypothesized factors were 
defined ar.d distinguishable, that the factor structure was invariant, over 
time, and that alternative models with fewer hypothesized t actors or a simpler 
structure did not fit the data as well. In a more general discussion of the „ 
Rotter scale it was concluded that when judged by current standards, the 
Rotter I-E. scale is a poor measurement instruments its internal consistency 
reliability is minimal', the rationale for its forced-choice format £a dubious, 
the assumption of its unidimensional ity is clearly wrong, and the 
distinguishable facets of the scale cannot be inferred with sufficient 
reliability to be practically useful. 
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, h The Hotter I-E Scale 1 

Th© Multi dimensionality of the Rotter I-E Scale: 
An Application of Confirmatory Factor Analysis 
Ib2 laUCQilzg&fcfiCMi GeilStoct As iQigrrgd From the Rotter Scale*. - 

Internal -external (XHE) locus of control is hypothesized to be a bipolar 
construct; the locus is internal if a person perceives events to be contingent 
upon his/her behavior or relatively enduring personal characteristics, the 
locus is external when events are seen to be- contingent upon luck, fate, the 
control of powerful others, tfie environment, or some characteristic not under 
his/her own control (Lef court, 1976; Rotter, 1966; 1975; Stipek & Wies2, 
1981). While a large number of I-E scales have been developed, the most 
widely used is the Ratter Scale a/id this instrument will be the focus of the 
present investigation. . 

The Rotter I-E scale consists of 23 pairs of statements, using a forced- 
choice format, and six filler questions. Each pair contains one internal 
statement and one external statement, and subjects make a dichotomous choice 
between the two alternatives. The scale is based on the assumption that I-E 
is^a relatively uni dimensional , bipolar construct. The bipolarity of 
responses to the Rotter scale is a necessary condition of the forced-choice 
„ format,, though research with other scales suggest that the construct may not 
be bipolar when independent ratings are made of interna^ and external items 
(e.g., Marsh, Cairns, Relich, Barnes & Debus, 1984; also see Collins, 1974; 
<lockars h Varnum, 1975; Zuckerman & Gerbasi , 1977). Marsh, Cairns, et al . 
a 984; also see Stipek & Weisz, 1981) also demonstrated that responses may be 
siecific to particular causes (e.g., abi 1 ity," effort) and to particular 
situations (e.g., performance in mathematics) . Nevertheless, Rotter (1966; 
1975) interpretted initial research to indicate that responses to his scale 
were unidimensional , or at least that one general factor explained moiit of the 
variance in the^total score, and this i*s how the scale has typically been used 
in I-E research. 
It)§ Present Investiga£i on.. 

The primary purpose of the present investigation is to examine the 
dimensionality of the Rotter Internal -External (I-E) scale. This will be 
accomplished by reviewing previously published studies, and by reanlayzing 
data from Marsh, Richards and Barnes (1984). Previous research has used 
exploratory facto'- analyses (EFA) , but the limitations in this approach to 
fi-ctor -analysis render it ill-suited for this purpose. Hence, «a second 

t 

purpose of the present investigation is to demonstrate the application of 
recent advances in the use of confirmatory factor analysis (CFA) for testing 
hypothesized factor structures and for testing the invariance of factor. 

♦ 

structures over different sets of responses in a reanalysis of responses to 
the Rotter scalr. . 
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The Rotter I-E Scale 2 

Data to be reanalyzed in the present investigation come from a study - 
described in more detail by Marsh, Richards and Barnes (in press). 
Participants in that study (N«361, median age - 21, 75'/.* male, 96X si-gle, 60*/. 
full-time employed, 33fc full-time students) completed the Roiter scale before 
and after the completion of the 26-day residential program called Outward, 
Bound. Though not a major focus of the study, the authors examined responses 
to the Rotter scale before and after completion of the program; coefficient 
alpha estimates of reliability were .71 and .78, and the test-retest 
correlation was .68. The Outward Bound course is designed to produce a more 
internal orientation, and posttest scores were significantly more interna 1 
than were pretest scores (p < .001). For purposes of the present, 
investigation the responses from 349 participants who completed the Rotter 
scale at the start (time 1) and at the end (time 2) of the 26-day program are 
considered. 

The design and the interpretation of findings from the original study are 
not the focus of the present investigation, but two assumptions underlying the 
analysis of the Rotter scores will be investigated further. First, the use of 
a single total score was based on the assumption that responses to the scale 
are relatively unidimension-i; and this assumption seems dubious. Second, the 
comparison of pretest and posttest scores — for a single dimension or for 
multiple facets — assumes that the factor structure was relatively invariant 
across the two administrations of the scale. If this invariance cannot be 
demonstrated, then the constructs being measured at the pretest and posttest 
differ, and scores representing these constructs cannot be meaningfully 
compared. This assumption may be particularly problematic in a study where 
the intervention is sper fically designed to alter the I-E orientation, and 
may cause some facets of the construct to become more salient. It should be 
noted that a similar assu^ntion of factorial invariance also underlies the 
meaningful comparison of responses by randomly assigned experimental and no 
treatment control subjects, or the comparison of responses by distinct 
subgroups such as males and females, blacks and whites, etc. 
ID§ PiffiensionaUty. of the Rotter I = E, Scale^ 

,IbS Number of Factgr§l A critical initial step in all factor analytic 
studies is to determine the. number of factors needed to explain the responses. 
In preliminary analyses conducted for ths present investigation, empirical 
approaches were used to determine the number of factors needed to describe 
responses at time 1 and at time 2. While a detailed examination of these 
approaches is beyond the scope of this study (see Crawford,, 1975; E.erett, 
1983; Hakstain, Rogers «, Cattell, 1982} Kim & Mueller, 1978} Marsh ?, Hocevar, 
1984a), three approaches are most fr^ently used: the number of eigenvalues 
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The Rotter IE Scale 3 
greater than 1.0 (Kaiser,, i960); Cattail's (1966) scree test; and a chi -square 
test of statistical significance (the one used here ; is based on Joreskog's 
approach to aaxiaua likelihood factor analysis as psrformed by the 
commercially avaiUule SPSS prograrai see Burns, 1976). The. results from each 
of these approaches for responses from tine 1 and time 2 (Table 1) all • 
indicate that the Rotter scale is cU-Iy multidimensional , and that between 4 
and 9 factors are "eeded^to ex plain responses to the scale. 

Insert Table 1 About Here 
The results from previously published studies where at least one set of 
responses to th. original Rotter scale has been factor analyzed are summarised 
in Table 2. The authors of each of these studies argue for the 
multi dimensionality of the Rotter scale. In apparent contradiction to the 
findings from the present investigation, roost studies report only two or three 
factors, though some report more. However, the studies that report only two or 
three factors typically estimate the number of factors that are needed on the 
basis of either an incorrect application of the eigenvalue rule that 
"substantially underestimates the number of factors that are needed (see 
footnote i), or on the intarpretabil i ty of the identified factors. While the 
interpretability o< factors is an important issue, it may^not be an adequate 
basis for determining the dimensionality of a scale. The fact that both 
factors in a two-factor solution can be interpretted does not mean. that there 
are no aoditional factors. Even when additional factors cannot be readily 
interpretted, it does not mean that a smaller number of factors can adequately 
explain the data. The issue of the number of' factors is an important 
methodological issue that has not been given sufficient consideration in this 
research. . » 

Though the studies in Table 2 typically did not' report any of the 
empirical procedures listed instable 1, they often did present the proportion 
of variance that was explained jy the factors that were identified (see Table 
2). In* 19 EFAs of responses to the original Rotter items, between 87. and 207. 
(median * 13% vs. 14% & 187. in the present investigation) of the total 
variance is explained by the first factor (see footnote l), between 5% and 10% 
(median = 87. vs. 77. t* 97. in the present investigation) of the total variance 
is explained by a second factor, and between 57. and 87. (median « 77. vs. 64 & 
6% in the present investigation) of the total variance is explained by a third 
factor. In no study did two factors explain more than 307. of the variance, 
nor did three factors explain more than 357.' of the variance. Furthermore, in 
all studies that reported 'the eigenvalues, or whore they could be inferrei, 
there were at* least five greater than 1.0. These findings indicate that there 
is good agreement between the results of the present investigation and 
previous research in terms of the proportion of variance that can be explained 
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by one, two and three factor*, and that part of the apparent contradiction^ 
the number of factor* is an artifact of the manner in which the number of 
factors was estimated in previous research. 

Insert Table 2 About Here 
, — 

IU8 QSQtenfc of Factors identified ^n EFA ituchgs^EFAs of the Rotter 
items summarized in Table 2 typically identified two or more of the following 
factors: general luck (often called general control , control ideology, or 
fate), political control, success via personal initiative («f ten called 
personal control), interpersonal control, control in -an academic situation, 
and. control in an occupational setting (sometimes called success mooilitv). 
Mirels (1970) described,* two-factor solution consisting of a general control 
(called general luck here) and a political control factor, and, factors like 
thes« are reported in most of the EFAs in Table 2). GuVin, Gurin, Loa and 
Beattie (1969), though not based on the original set of Rotter items, reported 
a personal control factor in addition to the luck and political control 
factors. Sanger and Alker (1972) described this factor as tantamount to 
endorsing the view that one's own efforts are responsibie for one's success or 
failure, and it is called success via personal initiative in 'the present 
investigation. While such a factor has apparently been found in several 
studies that consider only the original set of Rotter items, its 
interpretation is sometimes tenuous. The factors characterized here as s 
general luck and success via personal initiative are sometimes difficult to 
distinguish, and may have a considerable overlap in the items that define 
them. Furthermore, when the two are not identified as separate factors, th^y 
are typically incorported into one factor that is usually called, general 
control that appears to have components of each. This situation is exacerbated 
by the fact that many Rotter items ask respondents to choose between 
statements that refer to luck and success due to pprsonal initiative. 

Abrahamson, Shulderman and Shulderraan, (1973) reported a three-factor 
solution, but their third factor was an interpersonal or social factor that 
has been identified in a number of other .EFAs. Particularly when more than 
three factors are considered, some researchers have found an academic control 
factor (e.g./ Garza S«-Widlak, 1977; Little, 1977; Watson, 1931; and perhaps 
O'Brien & Kabanoff, 1981). Finally, an occupational factor has been identified 
in some EFAs (O'Brien «< Kabanoff 1981; also see Gurin, Gurin J, Morrison, 
1978), though the factor is defined by only two items and thus is necessarily 
weak. While most researchers attempted to interpret only two or three factors, 
four studies have identified either five (Garza C Widlak, 1977; Little, 1977, 
Watson, 1981) or six (O'Brien & Kabanoff, 1981) factors that are apparently 
similar to those described here. 
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The Rotter. I-E*ScaJe S 
Despite the apparent consistency of the findings summarized in Table 2, • 
the conclusion drawn by Dixon, McKee and McRa* <1976, p. 318) is still valid: 
even though researchers often use similar descriptive labels for their 
factors, the actual item content of their factors may vary dramatically fro® 
study to study. This is particularly true in these EFAs in that: mast 
loadings are small, some items do not load substantially or, any factor, some ' 
items load on more than one factor^and the size of a loading used to decide 
whether or not an item loads significantly on a particular factor is typically 
arbitrary. 

In reviews of this factor analytic research, l.ange and Tiggemann U931; 
also sec Cherlin & Bourque, 1974) suggest that the use of the Rotter I-E items 
in the context of additional items, and its administration to highly selected 
samples may influence' the .instrument's factor structure. Many researchers have 
attempted to compare factor analytic results obtained from different sets of 
responses, or to compare i^eir factor solution with those obtained in other 
studies. Comparisons of different factpr structures within a single study are 
most frequently made for response* by males and by females, by a general 
population and by some more narrowly defined subgroup, or by the same group of 
subjects on more than ocve occasion. . Methbdologi cal limitations inherent in the 
use of EFA for purposes of testing factorial invariance in such comparisons 
are discussed latter, and render the results of such exercises as problematic. 
Neverthele9s, most researchers suggest that at least some, and in some cases, 
al', of thsir factors generalize across different analyses. 

Other researchers have' factor analyzed responses to the scales where thV 
23 forced-*choice Rottrr items are presented as 46 Likert-type items*, thus * * * » 
eliminating some of the problems inherent with the forced-choice format" (e.g., 
Collins, 1974; Klockars & Varnum, 1975; Zuckerman & Gr,rbasi, 1977). While 
such an approach is reaspnable, and these results appear to De consistent' 
with the general findings described here, a detailed examination of these 
studies is beyond the scope of the present investigation. 

Since studies summarized in Table 2 include only those that used the 
original Rotter items, the important study by Gurin, et al . , (1978) was n$| 
included. They eliminated the three^Rotter items that refer to academic' 
situations and included three additional items in a study based on a large 
representative sample of the. general adult population. In each of the separate 
analyses of responses by white men, by white' women, and by blacks they found 
seven eigenvalues greater than 1.0, but chose to rotate five factors. While 
there was not complete agreement in the factors identified in their three 
sutpopulations, they report five factors (control ideology, political control, 
personal control, interpersonal control, and sue :ess mobility) that are 
similar to five of the six (general luc.k, pol i ti :al , control , success via 

6 ' 



The Rotter I-E Scale L 



personal initiative, interpersonal control, and occupational situations) 
summarized ear War; they did not find an academic factor since they had 
excluded the items that would have identified this factor. While they were 
able to clearly identify these distinguishable factors, the coefficient alphas 
for their scales were consistently low ( .41 to .65, mean = .55/, due at 
' least in part to the short length of some of the scales. They then went on to 
relate different facets of the the l~E construct to a variety of other 
constructs in an attempt to explore their convergent and discriminant 
validity, "thus, while the use of a somewhat different ret of items may call 
into question of generality of this study to other research summarized here, 
it represents an important contribution to the study of the dimensionality of 
the I-E construct. 

Iht @QQll£d|ign 9i QSQiiCffiStgry. factor Analysis <CFA). 

< In each. of the studi ^.described earlier researchers have used 
exploratory faqtor analyses (EFA) to examine the factor structure of responses 
to the Rotter I-E scale for a single group, or to compare factor structures 
derived from different sets of responses. While such an aporoach may be 
heuristic and suggest plausible factors, it is generally unacceptable. With 
EFA the researcher has little control over the factor structure to be tested 
beyond the number of factors to be rotated and perhaps the degree of. 
correlation among the factors. Hence, the researcher is unable to test the 
ability of a hypothesized structure to fit the data and is limited to an .a 
•posteriori interpretation of the factor structure that is derived by the EFA 
procedure. Since the EFA solution is not identified, there are an infinite 
number of alternative solutions that are mathematical ly equivalent (i.e., that 
fit tne equally w 11 and differ only in terms of the rotation of the % 
dimensions) and some may result in qui^e different interpretation!: of the 
underlying factors. EFA provides no indication of how well a hypothesized 
factor structure would be able to fit the data, nor how alternative structures 
compare in their ability to fit the data. Finally, \the inability to define 
the model to be tested for any one set of responses makes the comparison of 
solutions derived from different sets of responses so problematic that Alwin 
and Jackson argue that "the use of exploratory factor analysis in its 
conventional form to examine issues rf factorial l.nvariance is of limited 
utility " (1981, p. 253; also see farsh I Hocevar, 1984a). In contrast to 
EFA, CFA allows the researcher to: defihe and test a hypothesised factor 
structure; uniquely estimate the parameters used to define a hypothesized 
model; examine a model's ability to fit the data; compare the goodness-of -f i t 
for alternative models; and test the invanance of all parameters, or any 
Subset of parameters, used to define a model across different sets of 
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responses. (For a sore detailed comparison of EPA and CFA, and a general 
introduction to CFA, see Bago^i, 19SO; Huba & Dent i or, 1902; h>rsh & Mocevar, 
1983; 1984a; J984bi in press; Jofes^og, 1971; i960; Jcrpsfeog Sorbos, 1981; 
Long, 1983a; aJ983b; and Pedhauaur, 1982). 

I&S CctmoQ gf iq Q Priori CFA, unliKe £FA, begins with a 

specially defined model. When the factors that an instrument is deigned 
to Measure are well -define 4, then the design of the instrument serves as one a 
priori basis for defining the factor structure. For example, a single-factor 
model where all items are allowed' to load on one factor is consistent with the 
design of the Rotter scale. However, the formulation of models.: when there is 
no clearly defined a priori model, or when alternative models are sought, is 
not straight -forward. One possible approach is.to first use EFA to 'Suggest 
viable structures and then to test theses-structures with CFA. • Alternatively, 
past research and the content of the items can be used to formulate 
alternative models. 

In the .present investigation, an inspection of the 23 Rotter iteUHKby the 
first author suggested six possible factors: General Luck,, Political Control, 
Success Via Personal Initiative, Interpersonal Control, Academic Situations, * 
and Occupational Situations. The first four repcesent different* sources of 
control, each 3 appeared to be represented by at least' five items, and each 
appeared to be similar to factors identified in previous research. The last 
two factors are defined by specific situations rather than sources, of control; 
they are apparentl /.represented by 'only 3 and 2 items respectively, and they 
also have been identified in previous research. In order Jto generate an a 
priori model, the first author and three coll&agues independently c lassi f i ed f 
each^item into one or more of these six 4 categories. (Since each Rotter item 
consists of two separate statements, many items cannot be unambiguously to 
categor^ed into just one category and when an, item was classified into more 
than one category, thu category that best represented the item was also 
indicated^) Agreement among the four sets of classifications varied from 85*4 
to 95*/. <mn = 907.), Based on these responses, a. Single -"best" category and • 
secondary categories (any category indicated by at least two of the four 
raters) ^were designated for each item. The results of tnis process provided 
good support for .five, and some support for the sixth, of the six proposed 
categories; the Occupationa\ Situation factor was defined by only two two 
items and neither of these items were percnved as best represented by this 
category by any of the raters. The results of this exercise were used to 
specify a series of a priori models that hypothesized 1 to 5 factors to " 
explain responses to the* Rotter items (aee Appendix I). 

• ' Ib§ Definition of CFA^ Models., In CFA performed by LISREL , alternative 
models are specified in terms of three design macricos that are conceptually * 

/ 0 
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*f - ftimUr to matrices resulting from EFA: , LAMBDA Y, a matrix of factor loadings! 

: a factor variance/cover iance aatPix that represent* relation* among the 

factors;^ and TMETA EPSILON, a .matrix "that 'contains err or /uniquenesses - 
*i,iular to one minus the communality estimajes in EPA - in the diagonal, in 
trie definition of each «odel, every clemcntUn these *at r it** ist a) fixed at 
a predetermined value, usually 0* or. 1,; b> treeiy estimated a* part of the 
analysis; or, c) constrained such that two elements have the sane value <this 
type of constraint is used to test for the jnvarrance of estimates for two 
different sets ^f responses as described bellow*. For purposes of the present 
investigation, alternative models proposed 1, 2, 3, 4, or 5 'actors 
corresponding to the categories described earlier (see Appendix 1). For all 
but the one-factor model, simple and compttL structures we?re examined. For 
simple structures each item w*s allowed to load on only one factor, while for 
complex structures some items were allowed) to load on core than one factor as 
specified in the a priori models. . In ordJr to define each model, one item Mas 
selected to serve as a reference indicator) for each factor; its factor loading 
was fixed to-be 1.0 on that factor and itjwas not all^wea to load on any other 
factor (see Long, 1983a, pp. 49-5S for further discu ,ion of reference 
' loadings).. . ; » \ . 

§9S^Q8§§~Bf~iiti A plethora of mdicfes of a model's goodness-of-f it are 
,uscd in CPA, but there are no well established guidelines for what minimal 
condit ions constitute an adequate goodneis^of-f i t. In general » a) the 
par * aGt,?r estimates are examined in relation to the substantive model and 
permissible values* b) the c*u -square va^ue for a model is evaluated in terms 
of statistical significance and compared 1 with that, obtained for alternative . 
. i models; and, c) 'subjective, indices of gpodness-of -f it are examined and 

compared with values obtained for alternative models. In contrast to 
traditional signi/f icanCe testing,' the r jjs^archer «ay prefer ' a nonsigif leant 
eh; -square' test of statistical significance that indicates that a proposed 
, model is able to fit the observed data within the liaiits of chance.' However, 
^ , there are problems with this test. Firjst, the test .is highly sensitive to 
, j departures from multivariate normality.: Second, in large complex problems 
\ whero *»2es> and the number cjf measured variables are large, the 

' test is so powerful' that the c«T~square will generally be statistically 
significant even when a model provides a reasonably good fit to the data. 
/ Hence, most practical applications of CFA require a subjective evaluation of 
whether a statistically si gni f i cant, chi -square A n small enough to constitutes 
an adequate' fit. 

Many subject>ve indices of goodness~Of-f i t have been developed and some 
of the most frequently used have been calculated for models described in this 

.er|c . fj 
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study. A detailed examination of these indices is beyond « he scop© of thi» 
studyXtsee.Bentler Bonett, 1981 j Cliff, 1983| PornoH, |983| JoresfeG* & 
Sorboe, 19Q1| Long. t9B3*i 1983b! Marsh I, Heeevar, !984«j i.V84b» in press? and 
discussion Mill be I i si ted primarily to the ehi -square/dl ratio and the 
Tucfccr-Umu Index <TLI>. Researchers generally interpret K ? /df ratio* of 
less than 2.0 to represent an adequate fit, but the value' of Ult»^«tto will 
increase with sample sue independent of the abiUty of the model to fit the 
data. Because of this problem, alternative indices such as the Til were - 
devised that are not dependent on taeple >izc. the TU scales the observed 
chi-sqtjare along a 0-1 scale Khere 0 represent* the chi- square obtained free a 
.null oodel and I repre&ent* an ideal fit. The logic of this index- is SieiiaV 
to indices such at eta and omega that are frequently used in ANUVA as m 
indication of the proportion of variance explained, Since there is not wide 
agreement for evaluating the absolute values of these different indices* they 
are sore useful, perhaps, as a oasis for comparing alternative eodels used to 
describe the same set of responses. 

Insert Table 3 About Here' % 
Pie BttSUitft Qi dlUCQtUYt dgSfiiSt The results of alternative mcd*ls are 
summarized in Table Z. ■ In general, codels that hypothesise more factors are 
. better able to fit the data than models that propose fewer factors, and 
complex .uodeis ar^ better able- to fit thx« data than simple eodels. This 
t pattern is somewhat stronger for data from time 2 than for tiee 1. 
Nevertheless, for bath time I and time 2, the five-factor model with a complex 
structure (Model 5B> is better able to fit the data than eodele that 
hypothesise fewer factors or £hat propose a simple structure of factor 
loadings. The chi -square for Model SB is statistically significant for data 
tor time 1 and tie* 2, but the goodness of -f i t indices tfkggest that the model 
is able to adequately fit the data. In particular, despite the relatively 
Urge saraple sue, the xt/df ratio*, (i.24 for tic* 1, and 1.3S for tiee 2 I 
.are much small or' than 2.0. Although thp cht square value lor Node! fi8 i* 
slightly smaller lor tiae i than time 2, the TLI is better for time 2. Tbss< 
occur* becaube the average covariation among items is larger for" time 2 than 
time !, and thus there is more variance' to be explained in these responses. 

M sixth factor, Occupational Situations, was originally proposed, but 
*ip*» only two items (9 and.} 3) classified into this category and neither 

was perceived to be primarily determined by this factor, it could not hi? 
adequately defined. However, this t f >c.tor \ s represented in Model b>' 
allowing the error /uniqueness terms dn f He TA EPSU.0N) tor these two items to 
be correlated. Cven though Model SB prodded « good fit to the tUta, end only, 
one additional parameter is, estimated in Model 5C, the chi -square for Model 3C 

is significantly better. This means that in both sets of response* the two 

i n 
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occupational item* are aor© highly correlated with each other than' can be 
explained ui ter*s of the ftve^ actors. This is interpret ted aft support for 

Occupational factor, though a»< alternative interpretation would be that 
these two itees are influenced by a «©thod/respon?it? bias produced by the 
reference to an occupational setting that appear* it* both ite*s. 
I SilS 8l CaSlStiaJ. AQ^CUOSSa 

fctash of the codecs eueaar ized its Tabic 2 was tested in teres of its 
ability to at th» data for tie* t and for time 2. hodel 5L provided the beet 
fit to the data for both sets of responses, but none of the parameter 
estt nates were constrained to be equal across tie* i and ti«e 2 in order to 
test the invar tance of the hypothesised structure. For purposes of the 
present investigation, the mvariance of Models 1 and SC is tested. Model SC 
provided the best fit to the data, and so it is relevant. However, since aost 
research uses a single total score to summarize responses to the Rotter scale 

the «-*uVe w. <Af!.t with the original design of the instrument, tests' 
of the in variance of the one-factor »odel say also be relevant. 

For tests of the -variance of the factor structure over two different 
* oU °* r «?*Pon*es, the estimated value* for any single parameter, for any 
subset of parameters, or for all parameters can be constrained to be equal in 
solutions representing each set of responses. The L1SRSL procedure fits the 
data subject to these equality constrain , and an evaluation of the goodness- 
of -fit indices is used to infer how dc «©ntai the constraints are to the fit 
of the data (for a awe detailed diseus».. 3 n of factorial invariance see Alwin 
I Jaivsoi., 1961$ Everett Roy, Sv8l| Joreskog «. Sorboss, 1981; Marsh & 
Kocevar, in press} l"B4a>. To the extent that a »odel with equality 
constraints is able ta fit the data nearly as well as a *odei without these 
constraints,, there is support for the invar iance of the constrained 
oaraaeters. It should be noted that the chi -square value for a model with 
equality constraints can never be smaller than that of the corresponding 
unconstrained eodei , and that the chi-suuare values will only be the same if 
the parameter estimates under consideration *re onactly equal when no 
constraints are imposed. In this sense, the unconstrained aotfel represents an 
upper littit, or target, for the goodness of -fit for the constrained eodel. An 
alternative mde* of goodness of -fit for the constrained model, the target 
coefficient UC> , is the ratio of the chi -square values for a constrained and 
an unconstrained eodeis. Tm, TC provides an est»»ate of the proportion of 
.-ariance m the unconstrained code I that a model with equality constraints is 
*01« to e*pia»n Table 4, d | 00 see Marsh L riocevar, m press, for * 

ti«ilar derivation). 

In the present investigation, tour tests of factorial invar iance were 



The Rotter I-E Scale 11 
conducted for Models 1 and 5C (tee Table 4). In test' I, the most demanding 
test, every parameter estate for time 1 was constrained to be equal to the 
corresponding parameter estimate for time 2 — factor loadings in LAMBDA Y, 
factor variances and covariances in PSI, and error /uniquenesses in THETA 
EPSILON. Progressively lesi demanding tests of invariance were examined such 
that eiemer.tr in TH£TA EPSILON (tests 2.3M), elements in PSI (tests 3 * 
4>, and elements in LAMBDA y (test 4) were not constrained to be equal over 
time, in test 4, the least restrictive model, no invariance constraints were 
imposed at all; the cni~souare and df for this model are the sum of the values 
determined separately for responses from time 1 and from time 2 (Table 3). For 
both Models 1 and 5C, test 1 produced poorer goodness -of -fits than did the 
less restrictive tests of invariance, though even here the fits were 
reasonably good and the TCs were over .90. In tests 2 and 5 where the 
elements in THETA EPS I LOW were not constrained to be equal, the goodness-of- 
fit indices differed little from test 4, and the ^£were all .96 or greater. 
These findings provide strong support for the invariance of factry* loadings, 
factor variances, and factor covariances over time. As described earlier, 
this finding is important in that it provides support for the comparison of 
pretest* and posttest scores, as well as providing further evidence of the 

t 

generality of the factor structure. 

Insert Table 4 About Here 
The interpretation «f the tests of invariance for Model 5C is straight- 
forward m that: a) the constrained models are able to fit tne data, and b> 
the goodness-of-f it indices for the' constrained models, except possibly for 
test 1, differ* little from the unconstrained model. However, the 
interpretation of the invariance tests for "Model 1 is more complicated. As 
with Model SC, the goodness-of-fit for'the constrained models differ, little 
from the unconstrained models. However, none of the tests of Model 1 provides 
nearly as goed a fit as do the tests of Model SC. This situation is 
illustrated by the fact that the TCs for the tests of Model 1 are very high, 
even though the other goodness~of-f it indices are much poorer than those for 
Model SC. For purposes of the present investigation, this situation is. 
interpreted to mean that the responses to the Rtft'ter items cannot be 
adequately explained .by a single score, but that if a single score is used *** 

then it is reasonable to compare it at timt? 1 and time 2. However, the ol 

O 

justification for the use of a single score rests in part on additional O 
theoretical considerations, and these will be discussed latter. CO 
t*Q &US2iQflS.LSQ Qt th§ Parameter Estimates.. CQ 
In order tp evaluate Model 5C it 'is important to examine the values of 
he actual parameter estimates (see Table 5). These estimates represent 
responses from both time 1 and t'.me 2 in that the estimates come from the test 

\ M 
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of invariance i«hcr- all the estimated parameters were required to be equal 
over time. A preliminary inspection reveals that no parameter estimates fall 
outside of their permissible limits - factor variances or error /uniquenesses 
that are negative,, or factor covariances that are larger than the product of 
the factor standard deviations (i.e., factor correlations greater than 1.0). 
By itself this observation provides only minimal support for the model, but 
such. violations are frequent in CFAs with complex factors and argue against 
the validity of the proposed model. The parameter estimates do not resemble 
t^ose from £FA studies, partlyjaecause the estimates were based on the item 
covariances instead of the item correlations, and partly because reference 
variables were used to define each factor. However, for each estimated 
parameter — those not fixed at a predetermined value ~ the LISREL program 
also provides a standard error of .the estimate that is helpful in the 
interpretation of the results. This standard error can be used to form a t- 
ratio and to infer statistical significance; if the ratio is more than 1.96 
the estimate differs significantly from zero. 

: Insert Table 5 About Here 

The t-ratios for all five factor variances (the diagonal of PSI) are 
greater than 3, indicating that each of the factors describes a significant 
portion of variance. An examination of the factor loadings (in LAMBDA Y) 
indicate-, that each item contributes significantly to at least one of the 
factors that it was Hypothesized to represent. sin some cases where an item was 
nypothe«ized to define more than one factor not all the loadings are 
statistically significant, but the first four factors are each represented by 
at least four items that are significant as well as by their reference 
variable,. The fifth factor, Academic Situations, is only defined by three 
items - a reference item and two other items - but the ,t-ratios for all the 
estimated factor loadings and the factor' variance are statistically 
Significant. Hence, even the Academic Situation factor is clearly identified 
though it would be preferable to have more items. The third factor, Success 
Via Personal Initiative, 'is defined by 13 items* but estimates for seven of 
these failed to reach statistical -significance. (This, was also he factor 
that produced the least agreement among raters in the preliminary exercise 
used to formulate the a priori model, and in earlier EFAs described in Table 
2.) An examination of the content of the items hypothesized to define the 
th:rd factor did not reveal any obvious differences between items with and 
without significant loadings (e.g., those written in the first and third 
person;. However, except for the reference variable, every item hypothesized 
to define this factor was also hypothesized to reflect one or more other 
factors in the a priori model, and had a statistically significant loading on 

* ? \S 
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at least on* other factor in the empirical findings. Hence, the difficulty in 
determining what items best define this factor nay be due to t«e design of the 
Rotter instrument. 

Tha 'factor covariances in PSI are all statistically significant, 
indicating that the .factors are correlated. When these covariances are 
transformed into correlations by dividing each by the product of the factor 
standard deviations that they represent, the correlations (see Table 5> vary 
from .2 to' .8. Only the correlation between General Luck and Success Via 
Personal Initiative is greater than .55, and these are the two factors with 
the greatest item overlap in the a priori model. The fact that many items ask 
respondents to select between one statement referring to luck and one 
referring to success via personal initiative means that it is difficult to 
distinguish between these factors on the basis of the Rotter items. 

Joreokog and Sorbom (1981) describe the modification index <MI) that is 
useful in the further evaluation of a model. For every parameter that is 
fixed or constrained, the MI provides a lower-bound decrease that would result 
in the chi -square value if that parameter were freely estimated. Joreskog and 
Sorbom (1981) suggest that the MI for a parameter should be at least 5 before 
a researcher considers modifying the hypothesized model; a change in the chi- 
square of 5 is statistically significant at p < .01, though the probablity of 
a type I error is grossly inflated when such a test is applied to a large 
number of parameters (also see Marsh & Hocevar, in press) 1984). In the 
present investigation, 72 of the 115 factor loadings for Model 5C were 
hypothesized to be zero indicating that a" particular item would not load on a 
particular factor, and the validity of these hypotheses can be tested with the 
MI for each of these parameters. In various tests of the invariance of the 
parameter estimates for Model 5C, up to 80 parameters were constrained to be 
equal, and each MI indicates how much the contraint of the . one parameter 
affected the overall chi -square. For Model 5C, Mis were less than 5 for all 
factor loadings, factor variances and factor covariances in all tests of 
invariance for data from time 1 and time 2. In test 1 of the factorial 
invariance of Model 5C where elements in THETA EPSILON were constrained to be 
equal over time, approximately 15'/. of the error /uniqueness terms — though not 
the one correlated error term used to represent the occupational items — were 
larger than 5. These results provide further support of the interpretation of 
invariance tests presented earlier, and also suggest that no item would have 
contributed significantly to a factor that it was not hypothesized to 
represent . 

Summary and I(DBli£§t 1.0ns 
The primary purpose of the present investigation was to determine the 
dimensionality ot the Rotter I-E scale by reviewing previous research and the 
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methodology, of that research, and by applying CFA procedures that incorporate . 
racerCtl' advances in the application of factor analysis. Though the scale was 
originally designed to be uni dimensional and it is typically summarized with a 
single score, nearly-all studies that have specifically examined its 
dimensionality have concluded that it is multidimensional . Empirical 
guidelines to the number of factors that are needed, the results of previous 
research, an a priori model based on the content of the Rotter items, and the 
results of the CFA suggest that the Rotter scale contains 5 or 6 
distinguishable factors. Though" many eariiec studies report only 2 or 3 
Motors, an examination of the basis for the decision to report so few factors 
suggests that additional factors were needed to adequately explain the 
responses to the Rotter items. 

CFA, unlike EFA, begins with an a priori model of the hypothesized factor 
structure. An a priori model, based on the content of the Rotter items, Model 
SC, hypothesized five distinguishable factors and incorporated a sixth factor 
through the use of correlated error /uniquenesses. CFA results from time 1' and 
from time 2 both indicated that this five-factor model was. able to explain 
adequately the responses to the Rotter items, and that hypothesized models 

i 

with fewer factors or a simpler structure did not fit the data as well. An 
examination of the parameter estimates generated by Model 5C indicated that 
each of the factors was reasonably well defined, was distinguishable from the 
other factors, and explained a significant portion of the variance. Every item 
contributed significantly to at least one of the factors that it was ' 
hypothesized to represent, and no item contributed significantly to a factor 
that it was not hypothesized to' represent. Tests of the invarxance of thw 
factor structure indicated that the factor loadings, factor variances, and 
factor covariances, though perhaps not error /uniquenesses, were invariant over 
the two administrations of the test. 

Comparisons among factor structures identified in previous research must 
be made cautiously because of the limitations inherent in the use of EFAs 
employed in those studies. In no other study known to the authors was CFA 
used to test any a priori model of the factor structure of the Rotter items. • 
Rather, support for a particular model was hased on an idiosyncratic, a 
posteriori examination of factor loadings derived from an EFA. A test of 
factorial invariance with CFA is the recommended statistical procedure, 
perhaps the only justifiable procedure, for comparing the factor structures 
from two different sets of responses based on: a) before-after scores for a 
single group of subjects; b) responses by randomly assigned treatment and no 
treatment subjects; c) responses by different subgroups within a larger 
population; and d) responses to an original set of items and responses to the 

17 
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same set of items embedded in the context of additional items. Hence, this 

4 

procedure provides a basis for examining many questions raised in this area of 
research. More importantly the comparison of mean scores in each of these 
comparisons, the basis of nearly all experimental and correlational research, 
rests on the rarely tested assumption that the factor structure for the 
different sets of responses is relatively* invariant. Consequently, the 
procedures described in the present investigation represent an important 
methodological advance over previous research — one that can be used in 
reanalyses of data from earlier studies to examine previously unanswered 1 
questions and untested assumptions. The apparent success of the a priori 
model also provides support for the classification of of items into categories 
by independent raters as one basis for derivirig a priori models in other other 
CFA studies where well -defined factor structures do not. exist. 

Rotter's intention was to design a general I-E measurs, though he 
recognized that there may be many distinguishable facets and situationally 
specific components of the construct. In order to accomplish this purpose 
hotter constructed items that reflected many different specific facets with a 
relatively small number of items, and the surprisingly small average 
correlation among the responses to the Rotter items — about *. 1 — suggests 
that he was successful. While it may be justifiable to use such an approach 
to infer a general component, one that is superordi nate to the specific 
components on which it is based, such an approach will not produce a 
uni dimensional scale. Well-defined uni dimensional scales are typically based 
on a set of relatively homogeneous items that result in responses that are 
substantially correlated — exactly opposite to the approach used by Rotter. 
Rotter '5 approach also complicates the identification of a clearly defined 
factor structure. Even though the total score may reflect a broad range of 
specific components, many of these components may be represented by a small 
number of items or even a single item. Ironically, this difficulty has led 
some researchers to conclude that only one, or at most a few, facets are being 
measured ty tire RoUer scale. The fact that only a few factors are 
sufficiently well defined by the Rotter items to be consistently identified in 
EFA studies does not mean that a few factors are able to explain responses to 
the items. Empirical approaches used to estimate the number of factors 
represented by the Rutter items, as well as the logic used in the construction 
of the scale, suggest that there are many facets. Indeed, it is the extreme 
heterogeneity of the Rotter items rather than their homogeneity that makes it 
difficult to identify the many facets that are apparently reflected in the 
total score. - • ^ 

Part of the confusion in research based on the Rotter scale stems from an< 
apparent ambiguity in how a general scale is defined, and this problem is not 
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specific to'iocus of control research. A similar predicament in %elf-concept 
research h*s been recent examined by Marsh (Marsh, 1984; Marsh & Shavelson, 
in press). Historical , self-concept was typically inferred from responses 
to a hodge-podge of self-referent items, and researchers' inability to 
adequately identify distinguishable facets of self-concept was t**;en to mean 
that self-concept wa? a uni dimensional construct representing a general s*lf~ 
concept. However, subsequent theoretical models (e.g., Shavelson, Hubner & 
: Stanton, 1976), and factor analyses of responses to instruments derived from 
these models (e.g., Marsh & Shavelson, in press; Harsh, Barnes & Hocevar^ in 
press), now provide strong support fnr the multi dimensionality of self- 
concept. In this approach, facets were hypothesized on the basis of a well- . 
articulated theoretical model, a sufficient number of it^ms were constructed 
to adequately define each separate facet, and then empirical tests were 
conducted to establish that the hypothesized factor structure was able to fit 
the responses and to examine other psychometric properties. ~ The advantages of 
such an approach are that the underlying facets are specifically hypothesized 
and are empirically testable, thus providing a clear basis for interpreting 
responses to the instrument. Also, external criteria are more accurately 
predicted by specific facets that are logically related to them than by a 
general total score. This approach has been used with apparent success in 
self-concept research, and it may also be useful in locus of control research 
(see Gurin et al., 1978). 

Even if the specific facets underlying a general I-E construct, and their 
structure, can be articulated and supported empirically, this may not resolve 
the issue of how a general I-E score should be inferred. Marsh (1984), again 
based on self -concept research, described three different approaches: a) the 
total score from an idiosyncratic set of specific items that may or may not be 
balanced with respect to the appropriate facets of the general construct} b) a 
higher-order factor, one derived from a higher-order factor analysis of 
relations among well-defined first-order facets that is consistwt with a 
hierarchical model of the general construct; or c) a super ordinate, relatively 
unidimensional component rf the general construct that is inferred from 
responses to items that ar e not specific any particular facet (in J&elf -concept 
research this. is sometimes referred to as self-esteem). Rotter's strategy 
appears to correspond to the first approach, though the second approach would 
seem to be psychometrically more defensible and perhaps more heuristic* 
However, some researchers (e.g., Cherlm & Bourque, 1974; Lange lc Tiggemann, 
1981) suggest that the general control facet identified in many EFAs may 
represent a general factor^like that suggested in the third approach, and this 
possibility deserves further consideration. 

10 
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In locus of control research, a* in other areas of research, it nay be 
ill-advised to infer a general, construct from a small number of items designed 
to represent many specific components. -First, the particular items are rarely 
based on a theoretical model that provides a basis for sampling specific 
components. Without such a model, there is no basis for evaluating whether the 
items reflect the general theoretical construct, or whether they are balanced 
in respect to the appropriate facets of the general construct. Second, even 
if the selection- of items is based on a well-defined model of the general 
construct, it is difficult to determine empirically whether these facets are 
reflected in responses to the relatively small number of items chosen to 
^represent* these facets. A better approach is to construct instruments to 
measure each of the facets derived from a theoretical model, and to use 
empirical procedures such as CFA to test if these facets are reflected in 
responses to the instruments.^ Once a well-defined set of lower order factors 
has been identified, hierarchical CFA can *e used infer more general facets 
from the lower" order facets (see Marsh, 19B4j Marsh & Hocevar, in press)* On 
the basis o^ such research it may then be possible to select a relatively 
small number of items that adequately 'reflect the general construct, though 
this possibility would require further research. 

What are the implications of this research for the status of the Rotter 
scale and future research? The. .present investigation clearly shows that the 
Rotter scale is multidimensional, as does. Dearly .all research that has 
examined this issue, and so theVe^is .iO' justification for the use of the 
Rotter scale as a uni dimensional construct. However, even though multiple 
facets are clearly identified, most are not sufficiently reliable to be 
considered separately — their practical application would require the 
construction of new scales that contain more items and items that are more 
ciearly related to the specific facet that each is\designed to measure. 
Perhaps, more in keeping with the original design Of the Rotter scale, its 
total score might be used as a higher-order general I-E measure that is based 
on an average of specific facets of the construct. However, the justification 
for this must reside in a theoretical model that specifies what the particular 
facets are, how they combine to form tee general construct, and how these 
characteristics are reflected in the sampling of items in the Rotter IE scale. 
Clearly, such empirical support does not exist for the Rotter scale, and it. 
seems unlikely that the particular mix of items that appears on any instrument 
would adequately satisfy these criteria unless such a strategy were used in 
the original construction of items. * 

Other potential problems for the use of the Rotter scale stem from the 
forced-choice format of the Rotter items. First, though net the focus of the 
present investigation, the assumption that internal ity and externality 
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represent bipoVar end, of a single continuum - that the correlation bet-en 
thee. two conetruct. approach.. -1.0 - is problematic. Second.' the fact that 
each ,te» i. comprised of two statements that typically refer to different 
facets of I-E «an» that the determination of its factor structure is undu.lv 
complicated (see Collins. 1974, Klockar. e varnu., 1975, 7„cker.an & 6.rb.^, 
1977). Only items from th. Political Control facet are generally coepris* of 
two statewnts fro. the same facet, and this .ay explain why this facet is 
nearly always identified as a separate facet. Finally, once the 
«ultidi.ensionality of th. construct is recognized, a host of new theor.tic«l 
issues about ho« st.te.ents are pal-*, to for. each ite. oust be examined. 
Log.cally. it would see. that both the internal and external state.ent. that 
compris. „ch ite. should represent the sa.e facet, though other strategies 
«ay be viable. The authors' p.rsonal recommendation is against th. use of a 
forced-choice format in locus of control research. However, even if other 
researcher, favor th. use of a forced-choice for. at, this mast be justified on 
the bas»s of furth.r theoretical and .mpi-ic.l developeent of the I-E 
construct* 

The research conducted by Rotter .ore than 20 years ago. a. su««rw.d io 
his 1966 monograph, and the I-E scale that was darived fro. that res..rch, 
represented an important basis for locus of control research. Th. sc.l. ha. 
been very heuristic in terw of the amount of research that it ha, .ti.ul.t,d 
Nevertheless, judged by current standards, the Rotter I-E scale is a poor 
measurement instrument, the internal consistency reliability of its total 
score is .ini..l. th. rationale for its forced-choice format is dubious, th. ' 
assumption of it. unidimensionality is clearly wrong, the distinguishable- 
facets of the scale cannot be inferred with sufficient reliability to be 
practically useful, and there is no theoretical basis for combining response, 
to the different facets to form a general I-E score. The continued reliance of 
locus of control research on this type of instrument is likely to imped, 
further development of the construct. parUcularly its multifaceted natur.. 
instead, researchers should develop systematic models of th. important fac.t. 
of the gonial I-E construct and, perhaps, a hierarchical ordering of th«. 
facets. Such a mod.l. should then be the basis for the construction of 
appropriate measurement Instruments. In this way, empirical result, can be 
used to support, refute, or revise both the "instrument and the theory. Th. 
ongmal research Mr* „„ ftottBr ano subsequent reSBarcn ^ ^ 
Rotter I-E scale provide one important basis for further research and 
instrument construction, but the continued use of the Rotter I-E scale is not 
recommended. 
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FOOTNOTES 

1 — The eigenvalue rule is uased on the number of principal components that 
have eigenvalues greater than 1.0 — the number of unrotated factors that are 
able to explain at least 4.3-// (i.e., 1/23 x 100% where 23 is the number of 
Rotter items) of the total variance in analyses of the Rotter items. Some 
early studies in this area, apparently in a mistaken interpretation of the 
eigenvalue rule, have applied the rule to eigenvalues resulting from the 
rotated factors and the common factor variance, and this approach has then 
been used in some subsequent replications (see Gurin, et al., 1978, for 
further discussion). There is no justification for this second approach 
offered in any of the studies where it is used, suggesting that it does' 
represent a misunderstanding of of the original rule,- and it results in a 
substantially smaller number of eigenvalues that are greater than 1.0 than the 
correct application of the eigenvalue rule.- The apparent confusion between 
total variance and common variance has also been a source of confusion in 
other aspects of this research as described in footnote 2. 

i 

2 — Rotter (1966; 1975) reported that in two factor analyses, conducted early 
in the development of his scale, most of the variance could be accounted for 
by a single factor; he reported a figure of 537. for one of the analyses but 
did not give details about how this figure was derived. However, several 
sources of evidence indicate that Rotter must not have been referring to the 
proportion of the total variance that could be explained by a single factor. 
First, his suggestion is completely out of the range of findings summarized in 
Table 2 — in 19 factor analyses a single factor factor explained no more than 
20% of the variance (median = 13%) and not even three factors were able to 
account for more than 35*/. of the variance. Second, the amount of variance 
that can be explained by a single factor is closely related to the average 
correlation among items or, equivalently, the coefficient alpha estimate of 
reliability for the total score. However, the average correlations among items 
in the two studies cited by Rotter, derived from the internal consistency 
estimates reported in his 1966 monograph, were .092 and .088, and are lower 
than values in the present investigation ( .096 for time 1 and .135 fcr time 
2). This finding argues that less than 14'/. of the variance in the two studies 
cited by Rotter could be explained by a single factor, and leaves in doubt the 
derivation of the figure actually reported by Rotter. Finally, in a reanalysis 
of the original correlation matrix from one of the original studies cited by 
Rotter, Prociuk (1977) found that the no more than 9% of the variance could be 
explained by th'* first factor rather than the 53% reported by Rotter. Prociuk 
suggested that the 53% represented the percentage of common variance and not 
the total variance that could be explained by a general factor. 
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Table 1 

The Nu*btr of Factor* in the Rotter I-E Scale 

Criterion Tt«e i Txae 2 

a 

Number ©♦ eigenvalues 

greater than 1.0 V 7 

Kyeber a* statistically 

significant factors as 4 5 

det&remed fey a eh* -square 

test 

u . 

Cat tears Scree Test 4 7 

a — the 23- eigenvalues «ere* 3.3$, 1.61, 1,38, 1.24, 1.92, 1.14, 1.10 
1. 0£, 1,03, 0.98, 0.95, 0.88, 0^.6, 0.60, 0.75, 0.70, 0.67, 0.66, 0.63, 
0,39, 0.52, 0.30, and 0.43 4or time 1; and 4.15, 2.00, 1.38, 1.3, 1.14, 
5. SI, i.09, 1.0, 0.94, 0.87, 0,81, 0.78, 0.74, 0.72, 0.69, 0,66, 0.62, 
0.S7* 0.54, 0.5O, 0.48, 0.46, 0.41. 

b — TMs test requires the researcher to plot a graph of the factor 
nusber against the eigenvalues, and to deter nine the the nueber of 
♦actors beyond *hich the plot levels off to fore a relatively straight 
Sine that is nearly horizontal. However , as is the case in this 
application, the determination of the precise point where this occurs 
js somewhat subjective and different interpretations of the sense plot 
»* possible (K&« S> Kueiier, 1978) * 
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Table 2 

Su«*ary of Result* from Previous Factor Analyses of the Rotter Scale 

a 

Number of 



Percent Variance Ex plained 
By Factors 



Content of the 
Identified Factors 



^escrijstions gf Abbreviations Used tg Char acter i je Factors.. 
GL * General luck, general control, fate. 
PC « Political control. 

SV - Success via personal initiative, personal control. 
IC * Interpersonal control, likability, social relations. 
AS c Academic situations. 
OS * Occupational situations. 
*» = when presented with one of the above indicates that the 
classification of the factor is questionable- 



Eigenvalues 



btufly 


1ft 


2nd 


3rd 


4th 


Sth 


6th 


GL 


PC 


SV 


IC 


AS OS 


• Greater 
Than i.O 


i A 
1 H 


« CSV 


9/* 


7% 








X 


H 




X 






D 
D 


16/. 


10/. 


8/. 








X 


X 




.X 






£ H 




10/i 


6% 








X 


X 










B 


16% 


9% 


8% 








X 


X 










3 A „ 


14% 


9% 


6% 








>: 


X 




X? 






li 
t$ 




8/. 


8% 










X 




X? 


X? 




4 A 


13% 


7% 


7% 


6% 


6X 




X 


X 


x? 


X 


X 




B 


16% 


7% 


7% 


6% 


65i 




X 


X 


X? 


X 


X 




5 A 


16% 


6% 










X 


X 










3 

6 A 


20% 


10% 










X 

x*> 


X 
X 


X? 


X 


X 




7 A 


1 1% 


V% 










X 


X 










B 


1 2% 


7% 










X 


X 










8 A 


13% 


7% 


7% 


5% 


5% 


5X 


X 


X 


X 


X 


X X 


6 


9 A 


0% 


5% 
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1 2% 


7% 










X 












10 A 

c 

11 A 


6% 


5% 


5% 


5% 


47. 




X 


X 


X? 


X 


X 


5 


97. 


8% 


5% 








X? 


x*> 




x^ . 






B 


7% 


10% 


77. 










X? 










C 


12% 


9% 


6% 
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" S&vt4 v tQtf SfJBii BlISCiEtigntt 

i ■ Abrahamson, Schludermann & Schludermann (1973): A ■ 120 university males; 
B * 113 university females. 



2 ■ Cherlin and Bourque (1974): A * 161 university students} B ■ 100 adults 

from a general population. Eigenvalues were presented for 6 factors, but 
the interpreted solution was eased on three factors. 

3 « Dixon, McKee & McRae (1976): A * 98 male university students: B * 123 

female university students. 

6 

4 « Gar r a & Widlak '1977): A = 244 Chicano undergraduate students: B ■ 203 

Anglo undergraduate students. 

5 s Laruje and' Tiggemann (1981): A « 277 first-year university students: B = 93 

of the original sample retested 26 months latter. 

6 * Little (1977): A ■ 418 undergraduates. 

7 » Mirels (1970): A ■ 159 male undergraduates; B ■ 157 female undergraduates. 

B » O'Brien 8e Kabanoff (1981): A = 1921 individuals in a normative sample. 
Factor analyses were conducted on responses by subgroups in this study, but 
these results were not presented in sufficient detail to be presented. 

9 - Viney (1974): A ■ 159 male adolescents; B «• 134 female adolescents. 

10= Watson (1981): A ■ 147 university students. 

11* Wolk~& Hardy (1975): A » 115 Black nursing students; B = 125 White 
nursing students; C « White education students. The authors did not 
attempt to label the empirically identified factors. 

a — Whenever possible, the percentage of variance is based nn the 
variance explained by a factor before rotation, but some studies either 
reported percentage of variance explained after rotation or did not 
indicate whether the reported value was obtained before or after 
rotation. Since rotation procedures tend to distribute the explained 
variance more evenly than do the unrotated factors, this may produce a 
slight underestimate of the percentage of variance that can be 
explained by a single factor. In determining variance percentages and 
the number of eigenvalues greater than 1.0, a factor that accounts for 
4.3V. of -the variance (i.e., 1/23 x 100'/. where 23 is the number of items 
in the Rotter scale) has an eigenvalue of 1.0. Thus the equation 
eigenvalue « variance explained/4.3'/, was used to compute some values 
that were not reported. 

b — the study did not report the percentage of variance explained by 
the unrotated factors, but Watson U981) reported that the four and 
five factor solutions explained 387. and 44.5"/. of the variance. 

c — Factor loadings for three factors were presented for each sample, 
but neither the author of the original study nor we were able to make 
unambiguous interpretations of the content of 'those factors. 

Note: Studies included in this table were limited to published factor 
analyses of the original Rotter scale. Not included in this table were 
the .results of studies where: a) the results are unpublished; b) the . 
actual factor analysis was not based on the 23 Rotter items; c> the way ^ 
in which subjects responded to the items was altered d) the Rotter Jar 

items were translated into a foreign language; or e) the results were i 

not presented in sufficient detail to determine the content of 
identified factors and the percent of variance explained bv these 
factors. 
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Table 3 

6oodness-of-f it Indices for alternative models 



Model 


df 


X 




X /df 




RMS 




BBl 




TLI 








Tl^ 


T2 


Tl 


T2 


Tl 


T2 


Tl 

* 


T2 


Tl 


T2 


0 


253 


854 


1307 


3.38 


5. 17 


.025 


.032 














1 


230 


392 


583 


1.70 


2.53 


.012 


.016 


.541 


.554 


.497 


.511 


2A 


229 


313 


384 


1.37 


1.68 


.010 


.012 


.633 


.706 


.594 


.675 


26 


227 


306 


383 


1.3'i 


1.69 


.010 


.012 


.642 


.707 


.601 


.674 


3A 


227 


310 


374 


.1.37 


1.65 


.010 


.011 


.637 


.714 


.595 


.681 


3B 


216 


294 


347 


1.36 


1.61 


.010 


.011 


.656 


.734 


.597 


.689 


4A 


224 


302 


345 


1.35 


1.54 


.010 


.011 


.646. 


.736 


,601 


.702 


4b 


207 


267 


287 


1.29 


1.39 


.009 


.009 


.688 


.780 


.618 


.732 


4C 


203 


247 


252 


1.22 


1.24 


.009 


.009 


.711 


.807 


.640 


.756 


5A 


220 


290 


336 


U32 


1.53 


.010 


.011 


.660 


.743 


.609 


.704 


5B 


202 


250 


272 


. 1.24 


1.35 


.009 


.009 


.707 


.792 


.633 


. .739 


5C 


201 


241 


252 


1.20 


1.25 


.009 


.009 


.71B 


.807 


.645 


.758 



X2 /df ■ The ratio of the chi-square to the degrees-of -freedom (df)* 
RMS » The Root Mean Square residual is a measure ot the average of* 
residual variances and covariances for the original measured variables 
<Joreskog & Sfirbom, 1981, p. 1.41) 

2 2 2 * 

TLI = The Tucker-Lewis index is: 1-C(X /df >/(X /df )3. The X /df 

n m 2 n 

is the chi-snuare/df 1 ratio for the null model while the X /df is the 

corresponding value for the model being tested (Bentler & Bonett, 
1980). 

2 2 2 
BBI = The Bentler-Bonett Index is: 1-C (X >/(X >]. The X and 
w , n m n 

X are the chi-square values for the null and tested models, 
m 

Note: Each model is defined by a number and a letter. The number refers to 
the number ,of factors hypothesized by the model. The letter refers to the 
structure: A indicates a simple structure where each item is allowed to 
load on only one factor; B indicates a complex structure in which some 
items load on more than one factor; C indicates a complex structure (as in 
B) but in addition some of the error/uniquenesses in THETA EPSIL0N are 
hypothesized to be correlated. Model 0, the null model, hypothesizes that 
all thi measured variables are uncorrected, and it is used primarily in 
defining the lower bound for the TLI and BBI. Model 5C differs from model 
5B only in that the two items referring to occupation (items 9 & 13) are 
allowed to have correlated error/uniquenesses in THETA EPSIL0N. 
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TaMe ; 4 ■ • 
* ^ 

Four Tests of Factorial Invariance for Models 1 and 5C 







Parameters Constrained 


■ o 




A 1 






Model 


Test To Be Equal 


df 


dm 

X 


2 

X /df 


TLI 


SB I 


TC 




\ 


»*. * * 














0 








2161 


4.271 








1 


1 
1 


i v pes : tc 

LY | raj It • 




1086 


2 IS 


497 




. TV/ 






1 V» , PC 




1000 


2 07 

6i vr 


S3 7 


• Jl J 






3 


LY. ' ' 


AGO 


997 


2 Cfr 

A. • V f 




sr i e 
. JlJ 


on 9 




4 


none 


460 


97A 




V 

Sain 




1 (\f\fi 
I • vUU 


5C 


1 


LY, PS, TE% 


477 


654 


1.37 


.697 


.679 


.904 




2 


LY, PS 


453 


561 


1.24 


.740 


.710 


.960 




3 


LY 


438 


540 


1.23 


* .750 


•7i i 


.972 




4 


none 


402, 


494 


1.23 


.771 


.712* 


1.000 


a — 


LV ■ 


LAMBDA Y (factor loadings), 


PS « PSI 


(factor variances 


and 



covariances) , TE ■ THETA EPSXLON (error /uniquenesses) 

b — TC • Target Coefficient. The TC, a measure of a constrained 

model to explain variance accounted for the corresponding 

2 2 2 2 

unconstrained model, isi I - C (X )/(X )3 where X and X stand 

uc -c uc c 

for the chi -squares for the model with no invariance constraints and 

thtf constrained model in which some elements are constrained Jto t) v e 

equal across the multiple groups. .: .. 

NOTE: For model l."the chi-square value for test 1 differed 

significantly <p <. from the values for tests 3 and 4, but no 

other differences were statistically significant. For model 5G the 

chi-squ^re for test i* was signif icantly.fcigher^ than for the other 

tests, while the chi-square ior test 4 was significantly smaller; 

tests 2, and 3 did not differ significantly from each other. 

Nevertheless, the goodness-of -f i t indices for both models provide 

support for the invariance of factor loadings, and for factor 

variances and covariances, though perhaps not the error/uniquenesses. 
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Table 5 ■ 
Confir utory Factor Analysis of Responses to the Rotter I-E Scale: 
The Ratio of Parameter -Estimates to Standard Errors for Model SC 

b 



Error/ 

a . Uniqueness 

Factor Loadings (LAMBDA Y) • (THETA 



Item 












EPSILOhU 




PC . 


sv< 


IC . 


;AS - 




1 


00 


1.52/. 33 


1.49/. 53 


00 


- 

00 v 


■ ^ 

. 19/-. ©1 


2 


-0.71/. 12 


,■ 00 


00 


. 10/. to 


00 • 


.v'17/,01 


3 


00 


00 


.49/. 17 


A P\ t m A 

-.42/. 14 


00^ * _ 


, ***** M W m 

.•23/. 01 


A 
♦t 


• 42/. 17 


A ft/ i i «f 

-0.26/. 17 


00 


00 


-1. 70/^65 


.20/ .02 


0 


00 


4 Q § ft /I 

• It/. 24 


-0; 06/. 32 


• 43/. 18 


00 


b • It/. 01 


6 . 


00 


00 


00 


1 » 00 


00 


19/. 02 


7 


00 


.67/. It 


• 06/ • 2? 


00 


00 


.21/. 01 


8 


00 


00 


00 


00 


< CAM T 4 

-1.51/. 31 


• 1 1/.01 


Q 
T 


00 


a 1/ / .ft 

-0. 76/. 18 


-0. 10/. 27 


00 


AA 

00 


a ft J « ys 

. 18/. 10 


10 


-1.62/. 19 


00 


00 


00 


00 


. 14/. 01 


11 


00 


-0.41/. 18 


.60/. 29 


00 


00 


. 15/. 01 


12 


00 ■ 


-0.48/. 16 


.59/. 26 


00 


00 


. 10/. 01 


13 


00 


.33/. 09 


00 


.20/. 11 


00 


. 11/. 01 


14 


1.72/. 23 


00 


.19/. 19 


00 


00 


. 14/.. 01 


15 


00 


1.00 


00 


00 . 


00 


. 17/. 01 


16 


0J) 


00 


-0.07/. 18 


.74/. 17 


00 


.20/. 01 


17 


00 


2. 07/. 55 


2. 54/. 91 


00 


00 


.15/. 02 


IB 


-1.51/. 20 


oo . % 


-0.07/. 17 


00 


00 


.14/. 01 


19 


00 


00 


00 v 


00 


1.00 


.07/. 01 


20 


. 16/. 13 


1.12/. 12 


00 


00 


00 


. 15/. 01 


21 


00 


00 


-0.23/. 19 ■ 


-0.79/. IB 


00 


.22/. 01 


22 


00 


00 


1.00 


00 


00 


.15/. 01 


23 


1.00 


00 


00 


00 


00 


.19/. 01 


Factor 


Variances & Covariances (PSI; Factor 


Correlations Above .Diagonal) 




GL 


PC 


SV 


IC 


AS 




6L 


.038/. 008 


.494 


-.478 


.206 


.376 




PC 


.025/. .004 


.066/. 010 


-.826 


.507 


.536 




SV 


-.016/. 003 - 


-.037/. 006 


.031/. 007 


-.270 


-.516 




IC 


.010/. 004 


.031/. 006 


-.011/. 005 


.057/. 014 .323 




AS 


.008/. 002 


.015/. 003 


-.010/. 002 


.008/. 003 .011/. 003 


a — see Table 1 for the definitions of the 


factors. 







b — these values represent the diagonal of th£ THETA EPSIL0N matrix. In 
addition, pna off-diagonal value was estimated to represent the 
correlation between the two occupational items (9 & 13). This parameter 



estimate was -.028 with a standard error of .006. 

NOTE: The parameter estimates are are from responses to time 1 and to tifte 
2 when all parameter estimates were constrained to be invariant across 
both sets of responses. All items are scored as they originally appeared 
on the Rotter scale in that none were reflected, and the orientation of 
each factor is an arbitrary function of the orientation of the rtftrtnct 
variable. Factor correlation^*, presented above the main diagonal in PSI» 
were n.,t used in tests of the model, but are presented to aid in thi 
interpretation of the findings. 
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;* -•->*■; •. Atoeftdix I 

A Priori Definition of Model I M&tr Simple and Complex Structures That 
Hypothesized 1 to 5 Factors To Exp'lain Response to Rotter Scale. 



Model 
1 

2A t B 



Factors 
General I-£ 
General Luck. 



Items In rSimpie i * 
Co(SBlfH : §tr y c^u£:e. : 
all items „ 
^%4,-5i 6,7. 8^,11,12 
*%Z, 15*, 16.; 17* 19,20, 21,-22 



Items In Complex 
Structure QqIv. 



Political Control 2, 10, 14, 18,23* 



3A It B • General Luck 



1,4,12,13, 15*, 17,20 



Political Control 2,10,14,18,23* 

a -. . 

Success Via PI 3,5,6,7,8,9,11,16, 

19,21,22* 

4A i B General Luck 1,4, 12, 13,15*, 17,20 

Political Control 2,10,14,18,23* 

Success Via PI 5, 7, 8, 9, 1 1 , 19, 22* 

% Interpersonal Rel 3,6*, 16,21 

5A, B, General Luck 1, 12, 13, 15*, 17,20 

& C Political Control 2,10,14,18,23* 

Success Via PI 5,7,9,11,22* 



4,20 

5,7,9,11 

4,20 

1, 12,14,17, J8 
5,7,9,11 

4,20 .^-^ 

1,3,12,14,16,17, 

18,21 

2,5,13 

4,3,7,9,11 
4,20 

1,3,8, 12,14,16,17, 
18,19,21 
2,5,13 



Interpersonal Rel 3,6*, 16,21 

Academic Situation 4,9,19* * 

a — Success Via' PI * Success Via Personal initiative 

* Indicates variable chosen to be the reference variable for each factor. 
It was chosen to be the item that best represented that factor and was most 
uniquely related to it on the basis of the * priori classification of items 
by the four raters as described earlier. 

Note: Item numbers, I to 23, refer to the Rotter items after excluding the 
filler items. The definition of each factor is based on the classification 
of items into categories as described earlier. For models' that specified 2 
- 5 factors simple and complex structures were tested. For simple 
structures each item was allowed to load on only one factor (indicated in 
the column labelled "Items in Simple & Complex Structure") and for complex 
^structures some items were allowed to load on more than one factor 
(indicated in the column labelled "Items in Complex Structure Only*). 
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